34 research outputs found

    A review of probabilistic forecasting and prediction with machine learning

    Full text link
    Predictions and forecasts of machine learning models should take the form of probability distributions, aiming to increase the quantity of information communicated to end users. Although applications of probabilistic prediction and forecasting with machine learning models in academia and industry are becoming more frequent, related concepts and methods have not been formalized and structured under a holistic view of the entire field. Here, we review the topic of predictive uncertainty estimation with machine learning algorithms, as well as the related metrics (consistent scoring functions and proper scoring rules) for assessing probabilistic predictions. The review covers a time period spanning from the introduction of early statistical (linear regression and time series models, based on Bayesian statistics or quantile regression) to recent machine learning algorithms (including generalized additive models for location, scale and shape, random forests, boosting and deep learning algorithms) that are more flexible by nature. The review of the progress in the field, expedites our understanding on how to develop new algorithms tailored to users' needs, since the latest advancements are based on some fundamental concepts applied to more complex algorithms. We conclude by classifying the material and discussing challenges that are becoming a hot topic of research.Comment: 83 pages, 5 figure

    Ανασκόπηση των μεθόδων ανίχνευσης αντισωμάτων έναντι του SARS- Cov-2

    Get PDF
    • Ο κόσμος σήμερα έχει συσπειρωθεί όπως ποτέ άλλοτε για να πολεμήσει έναν κοινό εχθρό για τους ανθρώπους: την πανδημία του κορονοϊού. Οι CoVs είναι υπεύθυνοι για το σοβαρό οξύ αναπνευστικό σύνδρομο (SARS) που είχε πρωτοεμφανιστεί ως επιδημία τον Νοέμβριο του 2003. Σχεδόν δύο δεκαετίες αργότερα, ένας νέος κορονοϊός, ο κορωνοϊός SARS 2(SARS-CoV-2), που προήλθε από την πόλη Wuhan της Κίνας τον Δεκέμβριο 2019, οδήγησε σε μια άνευ προηγουμένου παγκόσμια πανδημία, αυτή της νόσου του κορωνοϊού 2019 (COVID-19), που έχει αποδειχθεί μια σοβαρή πρόκληση για την υγεία ολόκληρου του πλανήτη. Το επίκεντρο της καταπολέμησης του SARS-CoV-2 περιστρέφεται γύρω από τον εντοπισμό κρουσμάτων, την παρακολούθηση, την πρόληψη της μόλυνσης, τη διάγνωση και την υποστηρικτική φροντίδα. Προηγούμενες μελέτες σε πολλές ιογενείς λοιμώξεις υπέδειξαν ότι τα προστατευτικά αντισώματα δεν έχουν απαραίτητα εξουδετερωτική δραστηριότητα. Έτσι, το δυναμικό των ανθρώπινων αντισωμάτων στη θεραπεία ή στον εμβολιασμό εξαρτάται από το αν τα αντισώματα έχουν ρόλο στην εξέλιξη της νόσου ή στην προστασία από την ιογενή λοίμωξη. Στην παρούσα εργασία αναφέρονται οι ανοσοδοκιμές ανίχνευσης αντισωμάτων για τον Covid-19. Με τις εξής μεθόδους : i) μέθοδος LIPS ii) μέθοδος CLIA iii) μέθοδος CMIA iv) μέθοδος ELISA v) με το Liaison για το SARS -CoV-2 S1/S2 IgG ,οι μεθοδολογίες και τα αποτελέσματα αυτών. Γίνεται σύγκριση των κιτ SARS-CoV-2 IgG CMIA Alinity system and Liaison SARS-CoV-2 S1/S2 και αναφορά για τη δοκιμή του τεστ της EUROIThe world today is united as never before to fight a common enemy of humans: the coronavirus pandemic. CoVs are responsible for Severe Acute Respiratory Syndrome (SARS), which first emerged as an epidemic in November 2003. Nearly two decades later, a new coronavirus, SARS-CoV-2 (SARS-CoV-2), originated in Wuhan. China in December 2019, led to an unprecedented global pandemic, that of coronavirus 2019 (COVID-19), which has proven to be a serious health challenge for the entire planet. Its focus SARS-CoV-2 control revolves around case detection, monitoring, infection prevention, diagnosis and support care. Previous studies in many viral infections have suggested that protective antibodies have no neutralizing activity. So, the potential of human antibodies in treatment or vaccination depends whether antibodies play a role in disease progression or protection against viral infection. The present study reports antibody detection immunoassays for Covid-19. By the following methods: i) LIPS method ii) CLIA method iii) CMIA method iv) ELISA method v) with Liaison for SARS -CoV-2 S1 / S2 IgG, their methodologies and results. Compare the SARS-CoV-2 IgG CMIA Alinity system and Liaison SARS-CoV-2 S1 / S2 kits and report on the EUROIMMUN test

    Ensemble learning for blending gridded satellite and gauge-measured precipitation data

    Full text link
    Regression algorithms are regularly used for improving the accuracy of satellite precipitation products. In this context, ground-based measurements are the dependent variable and the satellite data are the predictor variables, together with topography factors. Alongside this, it is increasingly recognised in many fields that combinations of algorithms through ensemble learning can lead to substantial predictive performance improvements. Still, a sufficient number of ensemble learners for improving the accuracy of satellite precipitation products and their large-scale comparison are currently missing from the literature. In this work, we fill this specific gap by proposing 11 new ensemble learners in the field and by extensively comparing them for the entire contiguous United States and for a 15-year period. We use monthly data from the PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks) and IMERG (Integrated Multi-satellitE Retrievals for GPM) gridded datasets. We also use gauge-measured precipitation data from the Global Historical Climatology Network monthly database, version 2 (GHCNm). The ensemble learners combine the predictions by six regression algorithms (base learners), namely the multivariate adaptive regression splines (MARS), multivariate adaptive polynomial splines (poly-MARS), random forests (RF), gradient boosting machines (GBM), extreme gradient boosting (XGBoost) and Bayesian regularized neural networks (BRNN), and each of them is based on a different combiner. The combiners include the equal-weight combiner, the median combiner, two best learners and seven variants of a sophisticated stacking method. The latter stacks a regression algorithm on the top of the base learners to combine their independent predictions...Comment: arXiv admin note: text overlap with arXiv:2301.0125

    Machine learning for uncertainty estimation in fusing precipitation observations from satellites and ground-based gauges

    Full text link
    To form precipitation datasets that are accurate and, at the same time, have high spatial densities, data from satellites and gauges are often merged in the literature. However, uncertainty estimates for the data acquired in this manner are scarcely provided, although the importance of uncertainty quantification in predictive modelling is widely recognized. Furthermore, the benefits that machine learning can bring to the task of providing such estimates have not been broadly realized and properly explored through benchmark experiments. The present study aims at filling in this specific gap by conducting the first benchmark tests on the topic. On a large dataset that comprises 15-year-long monthly data spanning across the contiguous United States, we extensively compared six learners that are, by their construction, appropriate for predictive uncertainty quantification. These are the quantile regression (QR), quantile regression forests (QRF), generalized random forests (GRF), gradient boosting machines (GBM), light gradient boosting machines (LightGBM) and quantile regression neural networks (QRNN). The comparison referred to the competence of the learners in issuing predictive quantiles at nine levels that facilitate a good approximation of the entire predictive probability distribution, and was primarily based on the quantile and continuous ranked probability skill scores. Three types of predictor variables (i.e., satellite precipitation variables, distances between a point of interest and satellite grid points, and elevation at a point of interest) were used in the comparison and were additionally compared with each other. This additional comparison was based on the explainable machine learning concept of feature importance. The results suggest that the order from the best to the worst of the learners for the task investigated is the following: LightGBM, QRF, GRF, GBM, QRNN and QR..

    Deep Huber quantile regression networks

    Full text link
    Typical machine learning regression applications aim to report the mean or the median of the predictive probability distribution, via training with a squared or an absolute error scoring function. The importance of issuing predictions of more functionals of the predictive probability distribution (quantiles and expectiles) has been recognized as a means to quantify the uncertainty of the prediction. In deep learning (DL) applications, that is possible through quantile and expectile regression neural networks (QRNN and ERNN respectively). Here we introduce deep Huber quantile regression networks (DHQRN) that nest QRNNs and ERNNs as edge cases. DHQRN can predict Huber quantiles, which are more general functionals in the sense that they nest quantiles and expectiles as limiting cases. The main idea is to train a deep learning algorithm with the Huber quantile regression function, which is consistent for the Huber quantile functional. As a proof of concept, DHQRN are applied to predict house prices in Australia. In this context, predictive performances of three DL architectures are discussed along with evidential interpretation of results from an economic case study.Comment: 31 pages, 9 figure
    corecore